Search CORE

4 research outputs found

Seeing the Forest for the Trees: Using the Gene Ontology to Restructure Hierarchical Clustering

Author: Dotan-Cohen Dikla
Kasif Simon
Melkman Avraham A.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 03/06/2009
Field of study

Motivation: There is a growing interest in improving the cluster analysis of expression data by incorporating into it prior knowledge, such as the Gene Ontology (GO) annotations of genes, in order to improve the biological relevance of the clusters that are subjected to subsequent scrutiny. The structure of the GO is another source of background knowledge that can be exploited through the use of semantic similarity. Results: We propose here a novel algorithm that integrates semantic similarities (derived from the ontology structure) into the procedure of deriving clusters from the dendrogram constructed during expression-based hierarchical clustering. Our approach can handle the multiple annotations, from different levels of the GO hierarchy, which most genes have. Moreover, it treats annotated and unannotated genes in a uniform manner. Consequently, the clusters obtained by our algorithm are characterized by significantly enriched annotations. In both cross-validation tests and when using an external index such as protein–protein interactions, our algorithm performs better than previous approaches. When applied to human cancer expression data, our algorithm identifies, among others, clusters of genes related to immune response and glucose metabolism. These clusters are also supported by protein–protein interaction data. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.Lynne and William Frankel Center for Computer Science; Paul Ivanier center for robotics research and production; National Institutes of Health (R01 HG003367-01A1

Boston University Institutional Repository (OpenBU)

PubMed Central

Biological context networks: a mosaic view of the interactome

Author: Cantor Charles
Cohen Dikla Dotan
Kasif Simon
Rachlin John
Publication venue
Publication date: 28/11/2006
Field of study

Network models are a fundamental tool for the visualization and analysis of molecular interactions occurring in biological systems. While broadly illuminating the molecular machinery of the cell, graphical representations of protein interaction networks mask complex patterns of interaction that depend on temporal, spatial, or condition-specific contexts. In this paper, we introduce a novel graph construct called a biological context network that explicitly captures these changing patterns of interaction from one biological context to another. We consider known gene ontology biological process and cellular component annotations as a proxy for context, and show that aggregating small process-specific protein interaction sub-networks leads to the emergence of observed scale-free properties. The biological context model also provides the basis for characterizing proteins in terms of several context-specific measures, including ‘interactive promiscuity,' which identifies proteins whose interacting partners vary from one context to another. We show that such context-sensitive measures are significantly better predictors of knockout lethality than node degree, reaching better than 70% accuracy among the top scoring proteins

Crossref

Boston University Institutional Repository (OpenBU)

PubMed Central

Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering

Author: Avraham A. Melkman
Bellazzi
Buehler
Cheng
Crocker
Curtis
Dikla Dotan-Cohen
Doherty
Dotan-Cohen
Fang
Gatenby
Hanisch
Huang
Jiang
Khatri
Kustra
Kustra
Lin
Lord
Qi
Resnik
Schlicker
Simmons
Simon Kasif
Speer
Spellman
Teschendorff
Toronen
van 't Veer
Wang
Weber
Yona
Publication venue: Oxford University Press
Publication date: 03/06/2009
Field of study

Crossref

Boston University Institutional Repository (OpenBU)

PubMed Central

Biological Process Linkage Networks

Author: A Battle
A Schlicker
A Vazquez
AC Gavin
AH Tong
AJ Butte
Avraham A. Melkman
B Schwikowski
C Stark
D Finley
D Lin
D Segre
DA Stavreva
Dikla Dotan-Cohen
E Formstecher
E Segal
E Unal
EM Marcotte
F Luo
H Hishigaki
H Jeong
I Xenarios
JL Lu
JM Stuart
KR Brown
L Giot
LA Amaral
LF Wu
M Larochelle
MA Harris
MA Huynen
P Bork
PT Spellman
PW Lord
R Kelley
R Sharan
Rodolfo Aramayo
Simon Kasif
SL Wong
Stan Letovsky
TR Hughes
U de Lichtenberg
U Karaoz
X Guo
Z Lubovac
Publication venue: Public Library of Science
Publication date: 23/04/2009
Field of study

BACKGROUND. The traditional approach to studying complex biological networks is based on the identification of interactions between internal components of signaling or metabolic pathways. By comparison, little is known about interactions between higher order biological systems, such as biological pathways and processes. We propose a methodology for gleaning patterns of interactions between biological processes by analyzing protein-protein interactions, transcriptional co-expression and genetic interactions. At the heart of the methodology are the concept of Linked Processes and the resultant network of biological processes, the Process Linkage Network (PLN). RESULTS. We construct, catalogue, and analyze different types of PLNs derived from different data sources and different species. When applied to the Gene Ontology, many of the resulting links connect processes that are distant from each other in the hierarchy, even though the connection makes eminent sense biologically. Some others, however, carry an element of surprise and may reflect mechanisms that are unique to the organism under investigation. In this aspect our method complements the link structure between processes inherent in the Gene Ontology, which by its very nature is species-independent. As a practical application of the linkage of processes we demonstrate that it can be effectively used in protein function prediction, having the power to increase both the coverage and the accuracy of predictions, when carefully integrated into prediction methods. CONCLUSIONS. Our approach constitutes a promising new direction towards understanding the higher levels of organization of the cell as a system which should help current efforts to re-engineer ontologies and improve our ability to predict which proteins are involved in specific biological processes.Lynn and William Frankel Center for Computer Science; the Paul Ivanier center for robotics research and production; National Science Foundation (ITR-048715); National Human Genome Research Institute (1R33HG002850-01A1, R01 HG003367-01A1); National Institute of Health (U54 LM008748

Public Library of Science (PLOS)

Crossref

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

PubMed Central